Objective assessment method for stereoscopic video quality based on wavelet transform

ABSTRACT

An objective assessment method for a stereoscopic video quality based on a wavelet transform fuses brightness values of pixels in a left viewpoint image and a right viewpoint image of a stereoscopic image in a manner of binocular brightness information fusion, and obtains a binocular fusion brightness image of the stereoscopic image. The manner of binocular brightness information fusion overcomes a difficulty in assessing a stereoscopic perception quality of a stereoscopic video quality assessment to some extent and effectively increases an accuracy of a stereoscopic video objective quality assessment. When weighing qualities of each frame group in a binocular fusion brightness image video corresponding to a distorted stereoscopic video, the objective assessment method fully considers a sensitivity degree of a human eye visual characteristic to various types of information in the video, and determines a weight of each frame group based on a motion intensity and a brightness difference.

CROSS REFERENCE OF RELATED APPLICATION

The present application claims priority under 35 U.S.C. 119(a-d) to CN201510164528.7, filed Apr. 8, 2015.

BACKGROUND OF THE PRESENT INVENTION

1. Field of Invention

The present invention relates to a stereoscopic video quality assessmentmethod, and more particularly to an objective assessment method for astereoscopic video quality based on a wavelet transform.

2. Description of Related Arts

With the rapid development of the video coding technology and displayingtechnology, various types of video systems have been increasingly widelyapplied and gained attention, and gradually become the research focus inthe information processing field. Because of the excellent watchingexperience, the stereoscopic video has become more and more popular, andthe applications of the related technologies have greatly integratedinto the current social life, such as the stereoscopic television, thestereoscopic film and the naked-eye 3D. However, during the process ofcapturing, compression, coding, transmission, and displaying of thestereoscopic video, it is inevitable to introduce different degrees andkinds of distortion due to a series of uncontrollable factors. Thus, howto accurately and effectively measure the video quality plays animportant role in promoting the development of the various types of thevideo systems. The stereoscopic video quality assessment is divided intothe subjective assessment and the objective assessment. The key of thecurrent stereoscopic video quality assessment field is how to establishan accurate and effective objective assessment model to assess theobjective quality of the stereoscopic video. Conventionally, most of theobjective assessment methods for the stereoscopic video quality merelysimply apply the plane video quality assessment method respectively forassessing the left viewpoint quality and the right viewpoint quality;such objective assessment methods fail to well deal with therelationship between the viewpoints nor consider the influence of thedepth perception in the stereoscopic video on the stereoscopic videoquality, resulting in the poor accuracy. Although some of theconventional methods consider the relationship between the two eyes, theweighting between the left viewpoint and the right viewpoint isunreasonable and fails to accurately describe the perceptioncharacteristics of the human eyes to the stereoscopic video. Moreover,most of the conventional time-domain weightings in the stereoscopicvideo quality assessment are merely a simple average weighting, while infact the time-domain perception of the human eyes to the stereoscopicvideo is not merely the simple average weighting. Thus, the conventionalobjective assessment methods for the stereoscopic video quality fail toaccurately reflect the perception characteristics of the human eyes, andhave the inaccurate objective assessment results.

SUMMARY OF THE PRESENT INVENTION

An object of the present invention is to provide an objective assessmentmethod for a stereoscopic video quality based on a wavelet transform,the method being able to effectively increase a correlation between anobjective assessment result and a subjective perception.

Technical solutions of the present invention are described as follows.

An objective assessment method for a stereoscopic video quality based ona wavelet transform comprises steps of:

{circle around (1)} representing an original undistorted stereoscopicvideo by V_(org), and representing a distorted stereoscopic videoto-be-assessed by V_(dis);

{circle around (2)} calculating a binocular fusion brightness of eachpixel in each frame of a stereoscopic image of the V_(org); denoting thebinocular fusion brightness of a first pixel having coordinates of (u,v)in an f th frame of the stereoscopic image of the V_(org) as B_(org)^(f)(u,v), B_(org) ^(f)(u,v)=√{square root over ((I_(org)^(R,f)(u,v))²+(I_(org) ^(L,f)(u,v))²+2(I_(org) ^(R,f)(u,v)×I_(org)^(L,f)(u,v)×cos ∂)×λ)}; then according to the respective binocularfusion brightnesses of all the pixels in each frame of the stereoscopicimage of the V_(org), obtaining a binocular fusion brightness image ofeach frame of the stereoscopic image in the V_(org); denoting thebinocular fusion brightness image of the f th frame of the stereoscopicimage in the V_(org) as B_(org) ^(f), wherein a second pixel having thecoordinates of (u,v) in the B_(org) ^(f) has a pixel value of theB_(org) ^(f)(u,v); according to the respective binocular fusionbrightness images of all the stereoscopic images in the V_(org),obtaining a binocular fusion brightness image video corresponding to theV_(org), denoted as B_(org), wherein an f th frame of the binocularfusion brightness image in the B_(org) is the B_(org) ^(f); and

calculating a binocular fusion brightness of each pixel in each frame ofa stereoscopic image of the V_(dis); denoting the binocular fusionbrightness of a third pixel having the coordinates of (u,v) in an f thframe of the stereoscopic image of the V_(dis) as B_(dis) ^(f)(u,v),B_(dis) ^(f)(u,v)=√{square root over ((I_(dis) ^(R,f)(u,v))²+(I_(dis)^(L,f)(u,v))²+2(I_(dis) ^(R,f)(u,v)×I_(dis) ^(L,f)(u,v)×cos ∂)×λ)}; thenaccording to the respective binocular fusion brightnesses of all thepixels in each frame of the stereoscopic image of the V_(dis), obtaininga binocular fusion brightness image of each frame of the stereoscopicimage in the V_(dis); denoting the binocular fusion brightness image ofthe f th frame of the stereoscopic image in the V_(dis) as B_(dis) ^(f),wherein a fourth pixel having the coordinates of (u,v) in the B_(dis)^(f) has a pixel value of the B_(dis) ^(f)(u,v); according to therespective binocular fusion brightness images of all the stereoscopicimages in the V_(dis), obtaining a binocular fusion brightness imagevideo corresponding to the V_(dis), denoted as B_(dis), wherein an f thframe of the binocular fusion brightness image in the B_(dis) is theB_(dis) ^(f); wherein:

1≦f≦N_(f), wherein the f has an initial value of 1; the N_(f) representsa total frame number of the stereoscopic images respectively in theV_(org) and the V_(dis); 1≦u≦U,1≦v≦V, wherein the U represents a widthof the stereoscopic image respectively in the V_(org) and the V_(dis),and the V represents a height of the stereoscopic image respectively inthe V_(org) and the V_(dis); the I_(org) ^(R,f)(u,v) represents abrightness value of a fifth pixel having the coordinates of (u,v) in aright viewpoint image of the f th frame of the stereoscopic image of theV_(org); the I_(org) ^(L,f)(u,v) represents a brightness value of asixth pixel having the coordinates of (u,v) in a left viewpoint image ofthe f th frame of the stereoscopic image of the V_(org); the I_(dis)^(R,f)(u,v) represents a brightness value of a seventh pixel having thecoordinates of (u,v) in a right viewpoint image of the f th frame of thestereoscopic image of the V_(dis); the I_(dis) ^(L,f)(u,v) represents abrightness value of an eighth pixel having the coordinates of (u,v) in aleft viewpoint image of the f th frame of the stereoscopic image of theV_(dis); the ∂ represents a fusion angle and the λ represents abrightness parameter of a display;

{circle around (3)} adopting 2^(n) frames of the binocular fusionbrightness images as a frame group; respectively dividing the B_(org)and the B_(dis) into n_(GoF) frame groups; denoting an i th frame groupin the B_(org) as G_(org) ^(i); and denoting an i th frame group in theB_(dis) as G_(dis) ^(i); wherein: the n is an integer in a range of

${\left\lbrack {3,5} \right\rbrack;{n_{GoF} = \left\lfloor \frac{N_{f}}{2^{n}} \right\rfloor}},$wherein the └ ┘ is a round-down symbol; and 1≦i≦n_(GoF);

{circle around (4)} processing each frame group in the B_(org) with aone-level three-dimensional wavelet transform, and obtaining eightgroups of first sub-band sequences corresponding to each frame group inthe B_(org), wherein: the eight groups of the first sub-band sequencescomprise four groups of first time-domain high-frequency sub-bandsequences and four groups of first time-domain low-frequency sub-bandsequences; and each group of the first sub-band sequence comprises

$\frac{2^{n}}{2}$first wavelet coefficient matrixes; and

processing each frame group in the B_(dis) with the one-levelthree-dimensional wavelet transform, and obtaining eight groups ofsecond sub-band sequences corresponding to each frame group in theB_(dis), wherein: the eight groups of the second sub-band sequencescomprise four groups of second time-domain high-frequency sub-bandsequences and four groups of second time-domain low-frequency sub-bandsequences; and each group of the second sub-band sequence comprises

$\frac{2^{n}}{2}$second wavelet coefficient matrixes;

{circle around (5)} calculating respective qualities of two groups amongthe eight groups of the second sub-band sequences corresponding to eachframe group in the B_(dis); and denoting a quality of a j th group ofthe second sub-band sequence corresponding to the G_(dis) ^(i) as

$Q^{i,j},{Q^{i,j} = \frac{\sum\limits_{k = 1}^{K}{{SSIM}\left( {{VI}_{org}^{i,j,k},{VI}_{dis}^{i,j,k}} \right)}}{K}},$wherein: j=1,5; 1≦k≦K; the K represents a total number of the waveletcoefficient matrixes respectively in each group of the first sub-bandsequence corresponding to each frame group in the B_(org) and each groupof the second sub-band sequence corresponding to each frame group in theB_(dis);

${K = \frac{2^{n}}{2}};$the VI_(org) ^(i,j,k) represents a k th first wavelet coefficient matrixof a j th group of the first sub-band sequence corresponding to theG_(org) ^(i); the VI_(dis) ^(i,j,k) represents a k th second waveletcoefficient matrix of the j th group of the second sub-band sequencecorresponding to the G_(dis) ^(i); the SSIM( ) is a structuralsimilarity calculation function;

{circle around (6)} according to the respective qualities of the twogroups among the eight groups of the second sub-band sequencescorresponding to each frame group in the B_(dis), calculating a qualityof each frame group in the B_(dis); and denoting the quality of theG_(dis) ^(i) as Q_(GoF) ^(i), Q_(GoF)^(i)=w_(G)×Q^(i,1)+(1−w_(G))×Q^(i,5), wherein: the w_(G) is a weight ofthe Q^(i,1); the Q^(i,1) represents the quality of a first group of thesecond sub-band sequence corresponding to the G_(dis) ^(i); and theQ^(i,5) represents the quality of a fifth group of the second sub-bandsequence corresponding to the G_(dis) ^(i); and

{circle around (7)} according to the quality of each frame group in theB_(dis), calculating an objective assessment quality of the V_(dis) anddenoting the objective assessment quality of the V_(dis) as Q_(v),

${Q_{v} = \frac{\sum\limits_{i = 1}^{n_{GoF}}{w^{i} \times Q_{GoF}^{i}}}{\sum\limits_{i = 1}^{n_{GoF}}w^{i}}},$wherein the w^(i) is a weight of the Q_(GoF) ^(i).

The w^(i) in the step {circle around (7)} is obtained through followingsteps:

{circle around (7)}-1, calculating a motion vector of each pixel in eachframe of the binocular fusion brightness image of the G_(dis) ^(i)except a first frame of the binocular fusion brightness image, with areference to a previous frame of the binocular fusion brightness imageof each frame of the binocular fusion brightness image in the G_(dis)^(i) except the first frame of the binocular fusion brightness image;

{circle around (7)}-2, according to the motion vector of each pixel ineach frame of the binocular fusion brightness image of the G_(dis) ^(i)except the first frame of the binocular fusion brightness image,calculating a motion intensity of each frame of the binocular fusionbrightness image in the G_(dis) ^(i) except the first frame of thebinocular fusion brightness image; and denoting the motion intensitydegree of an f′th frame of the binocular fusion brightness image in theG_(dis) ^(i) as MA^(f′),

${{MA}^{f^{\prime}} = {\frac{1}{U \times V}{\sum\limits_{s = 1}^{U}{\sum\limits_{t = 1}^{V}\left( {\left( {{mv}_{x}\left( {s,t} \right)} \right)^{2} + \left( {{mv}_{y}\left( {s,t} \right)} \right)^{2}} \right)}}}},$wherein: 2≦f′≦2^(n); the f′ has an initial value of 2; 1≦s≦U,1≦t≦V; themv_(x)(s,t) represents a horizontal component of the motion vector of apixel having coordinates of (s,t) in the f′th frame of the binocularfusion brightness image in the G_(dis) ^(i), and the mv_(y)(s,t)represents a vertical component of the pixel having the coordinates of(s,t) in the f′th frame of the binocular fusion brightness image in theG_(dis) ^(i);

{circle around (7)}-3, calculating a motion intensity of the G_(dis)^(i), denoted as MAavg^(i),

${{{MA}\;{avg}^{i}} = \frac{\sum\limits_{f^{\prime} = 2}^{2^{n}}{MA}^{f^{\prime}}}{2^{n} - 1}};$

{circle around (7)}-4, calculating a background brightness image of eachframe of the binocular fusion brightness image in the G_(dis) ^(i);denoting the background brightness image of an f″th frame of thebinocular fusion brightness image in the G_(dis) ^(i) as BL_(dis)^(i,f″); and denoting a pixel value of a first pixel having coordinatesof (p,q) in the BL_(dis) ^(i,f″) as BL_(dis) ^(i,f″)(p,q),

${{{BL}_{dis}^{i,f^{''}}\left( {p,q} \right)} = {\frac{1}{32}{\sum\limits_{{bi} = {- 2}}^{2}{\overset{2}{\sum\limits_{{bj} = {- 2}}}{{I_{dis}^{i,f^{''}}\left( {{p + {bi}},{q + {bi}}} \right)} \times {{BO}\left( {{{bi} + 3},{{bj} + 3}} \right)}}}}}},$wherein: 1≦f″≦2″; 3≦p≦U−2,3≦q≦V−2; −2≦bi≦2,−2≦bj≦2; the I_(dis)^(i,f″)(p+bi,q+bi) represents a pixel value of a pixel havingcoordinates of (p+bi,q+bi) in the f″th frame of the binocular fusionbrightness image of the G_(dis) ^(i); and the BO(bi+3,bj+3) representsan element at a subscript of (bi+3,bj+3) in a 5×5 background brightnessoperator;

{circle around (7)}-5, calculating a brightness difference image betweeneach frame of the binocular fusion brightness image and the previousframe of the binocular fusion brightness image of each frame of thebinocular fusion brightness image in the G_(dis) ^(i) except the firstframe of the binocular fusion brightness image; denoting the brightnessdifference image between the f′th frame of the binocular fusionbrightness image in the G_(dis) ^(i) and an f′−1th frame of thebinocular fusion brightness image in the G_(dis) ^(i) as LD_(dis)^(i,f′); and denoting a pixel value of a second pixel having thecoordinates of (p,q) in the LD_(dis) ^(i,f′) as LD_(dis) ^(i,f′)(p, q),LD _(dis) ^(i,f′)(p,q)=(I _(dis) ^(i,f′)(p,q)−I _(dis) ^(i,f′-1)(p,q)+BL_(dis) ^(i,f′)(p,q)−BL _(dis) ^(i,f′-1)(p,q))/2,wherein: 2≦f′≦2^(n); 3≦p≦U−2,3≦q≦V−2; the I_(dis) ^(i,f′)(p,q)represents a pixel value of a third pixel having the coordinates of(p,q) in the f′th frame of the binocular fusion brightness image in theG_(dis) ^(i); the I_(dis) ^(i,f′-1)(p,q) represents a pixel value of afourth pixel having the coordinates of (p,q) in the f′−1th frame of thebinocular fusion brightness image in the G_(dis) ^(i); the BL_(dis)^(i,f′)(p,q) represents a pixel value of a fifth pixel having thecoordinates of (p,q) in the background brightness image BL_(dis) ^(i,f″)of the f′th frame of the binocular fusion brightness image of theG_(dis) ^(i); and the BL_(dis) ^(i,f′-1)(p,q) represents a pixel valueof a sixth pixel having the coordinates of (p,q) in the backgroundbrightness image BL_(dis) ^(i,f′-1) of the f′−1th frame of the binocularfusion brightness image of the G_(dis) ^(i);

{circle around (7)}-6, calculating a mean value of the pixel values ofall the pixels in the brightness difference image between each frame ofthe binocular fusion brightness image and the previous frame of thebinocular fusion brightness image of each frame of the binocular fusionbrightness image in the G_(dis) ^(i) except the first frame of thebinocular fusion brightness image; denoting the mean value of the pixelvalues of all the pixels in the LD_(dis) ^(i,f′) as LD^(i,f′);calculating a brightness difference value of the G_(dis) ^(i) anddenoting the brightness difference value of the G_(dis) ^(i) asLDavg^(i),

${{{LD}{avg}}^{i} = \frac{\sum\limits_{f^{\prime} = 2}^{2^{n}}{LD}^{i,f^{\prime}}}{2^{n} - 1}};$

{circle around (7)}-7, obtaining a motion intensity vector of theB_(dis) from the respective motion intensities of all the frame groupsin the B_(dis) in order, and denoting the motion intensity vector of theB_(dis) as V_(MAavg),V_(MAavg) =[MAavg ¹,MAavg², . . . ,MAavg^(i), . . . ,MAavg^(n) ^(GoF) ];

obtaining a brightness difference vector of the B_(dis) from therespective brightness difference values of all the frame groups in theB_(dis) in order, and denoting the brightness difference vector of theB_(dis) as V_(LDavg), V_(LDavg)=[LDavg¹,LDavg², . . . ,LDavg^(i), . . .,LDavg^(n) ^(GoF) ]; wherein:

the MAavg¹, the MAavg², and the MAavg^(n) ^(GoF) respectively representthe motion intensities of a first frame group, a second frame group anda n_(GoF)th frame group in the B_(dis); the LDavg¹, the LDavg², and theLDavg^(n) ^(GoF) respectively represent the brightness difference valuesof the first frame group, the second frame group and the n_(GoF)th framegroup in the B_(dis);

{circle around (7)}-8, processing the MAavg^(i) with a normalizationcalculation, and obtaining a normalized motion intensity of the G_(dis)^(i), denoted as v_(MAavg) ^(norm,i),

${v_{MAavg}^{{norm},i} = \frac{{{MA}{avg}}^{i} - {\max\left( V_{MAavg} \right)}}{{\max\left( V_{MAavg} \right)} - {\min\left( V_{MAavg} \right)}}};$

processing the LDavg^(i) with the normalization calculation, andobtaining a normalized brightness difference value of the G_(dis) ^(i),denoted as v_(LDavg) ^(norm,i),

${v_{LDavg}^{{norm},i} = \frac{{LDavg}^{i} - {\max\left( V_{LDavg} \right)}}{{\max\left( V_{LDavg} \right)} - {\min\left( V_{LDavg} \right)}}};$

wherein the max( ) is a function to find a maximum and the min( ) is afunction to find a minimum; and

{circle around (7)}-9, according to the v_(MAavg) ^(norm,i) and thev_(LDavg) ^(norm,i), calculating the weight w^(i) of the Q_(GoF) ^(i),w^(i)=(1−v_(MAavg) ^(norm,i))×v_(LDavg) ^(norm,i).

Preferably, in the step {circle around (6)}, w_(G)=0.8.

Compared with the conventional technology, the present invention hasfollowing advantages.

Firstly, the present invention fuses the brightness value of the pixelsin the left viewpoint image with the brightness value of the pixels inthe right viewpoint image in the stereoscopic image in a manner ofbinocular brightness information fusion, and obtains the binocularfusion brightness image of the stereoscopic image. The manner ofbinocular brightness information fusion overcomes a difficulty inassessing a stereoscopic perception quality of the stereoscopic videoquality assessment to some extent and effectively increases an accuracyof the stereoscopic video objective quality assessment.

Secondly, the present invention applies the three-dimensional wavelettransform in the stereoscopic video quality assessment. Each frame groupin the binocular fusion brightness image video is processed with theone-level three-dimensional wavelet transform, and video time-domaininformation is described through a wavelet domain decomposition, whichsolves a difficulty in describing the video time-domain information tosome extent and effectively increases the accuracy of the stereoscopicvideo objective quality assessment.

Thirdly, when weighing the quality of each frame group in the binocularfusion brightness image video corresponding to the distortedstereoscopic video, the method provided by the present invention fullyconsiders a sensitivity degree of a human eye visual characteristic tovarious kinds of information in the video, and determines the weight ofeach frame group based on the motion intensity and the brightnessdifference. Thus, the stereoscopic video quality assessment method,provided by the present invention, is more conform to a human eyesubjective perception characteristic.

These and other objectives, features, and advantages of the presentinvention will become apparent from the following detailed description,the accompanying drawings, and the appended claims.

BRIEF DESCRIPTION OF THE DRAWINGS

The figure is an implementation block diagram of an objective assessmentmethod for a stereoscopic video quality based on a wavelet transformaccording to a preferred embodiment of the present invention.

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENT

The present invention is further described with an accompanying drawingand a preferred embodiment of the present invention.

According to a preferred embodiment of the present invention, thepresent invention provides an objective assessment method for astereoscopic video quality based on a wavelet transform, wherein animplementation block diagram thereof is showed in the figure, comprisingsteps of:

{circle around (1)} representing an original undistorted stereoscopicvideo by V_(org), and representing a distorted stereoscopic videoto-be-assessed by V_(dis);

{circle around (2)} calculating a binocular fusion brightness of eachpixel in each frame of a stereoscopic image of the V_(org); denoting thebinocular fusion brightness of a first pixel having coordinates of (u,v)in an f th frame of the stereoscopic image of the V_(org) as B_(org)^(f)(u,v), B_(org) ^(f)(u,v)=√{square root over ((I_(org)^(R,f)(u,v))²+(I_(org) ^(L,f)(u,v))²+2(I_(org) ^(R,f)(u,v)×I_(org)^(L,f)(u,v)×cos ∂)×λ)}; then according to the respective binocularfusion brightnesses of all the pixels in each frame of the stereoscopicimage of the V_(org), obtaining a binocular fusion brightness image ofeach frame of the stereoscopic image in the V_(org); denoting thebinocular fusion brightness image of the f th frame of the stereoscopicimage in the V_(org) as B_(org) ^(f), wherein a second pixel having thecoordinates of (u,v) in the B_(org) ^(f) has a pixel value of theB_(org) ^(f)(u,v); according to the respective binocular fusionbrightness images of all the stereoscopic images in the V_(org),obtaining a binocular fusion brightness image video corresponding to theV_(org), denoted as B_(org), wherein an f th frame of the binocularfusion brightness image in the B_(org) is the B_(org) ^(f); and

calculating a binocular fusion brightness of each pixel in each frame ofa stereoscopic image of the V_(dis); denoting the binocular fusionbrightness of a third pixel having the coordinates of (u,v) in an f thframe of the stereoscopic image of the V_(dis) as B_(dis) ^(f)(u,v),B_(dis) ^(f)(u,v)=√{square root over ((I_(dis) ^(R,f)(u,v))²+(I_(dis)^(L,f)(u,v))²+2(I_(dis) ^(R,f)(u,v)×I_(dis) ^(L,f)(u,v)×cos ∂)×λ)}; thenaccording to the respective binocular fusion brightnesses of all thepixels in each frame of the stereoscopic image of the V_(dis), obtaininga binocular fusion brightness image of each frame of the stereoscopicimage in the V_(dis); denoting the binocular fusion brightness image ofthe f th frame of the stereoscopic image in the V_(dis) as B_(dis) ^(f),wherein a fourth pixel having the coordinates of (u,v) in the B_(dis)^(f) has a pixel value of the B_(dis) ^(f)(u,v); according to therespective binocular fusion brightness images of all the stereoscopicimages in the V_(dis), obtaining a binocular fusion brightness imagevideo corresponding to the V_(dis), denoted as B_(dis), wherein an f thframe of the binocular fusion brightness image in the B_(dis) is theB_(dis) ^(f); wherein:

1≦f≦N_(f), wherein the f has an initial value of 1; the N_(f) representsa total frame number of the stereoscopic images respectively in theV_(org) and the V_(dis); 1≦u≦U,1≦v≦V, wherein the U represents a widthof the stereoscopic image respectively in the V_(org) and the V_(dis),and the V represents a height of the stereoscopic image respectively inthe V_(org) and the V_(dis); the I_(org) ^(R,f)(u,v) represents abrightness value of a fifth pixel having the coordinates of (u,v) in aright viewpoint image of the f th frame of the stereoscopic image of theV_(org); the I_(org) ^(L,f)(u,v) represents a brightness value of asixth pixel having the coordinates of (u,v) in a left viewpoint image ofthe f th frame of the stereoscopic image of the V_(org); the I_(dis)^(R,f)(u,v) represents a brightness value of a seventh pixel having thecoordinates of (u,v) in a right viewpoint image of the f th frame of thestereoscopic image of the V_(dis); the I_(dis) ^(L,f)(u,v) represents abrightness value of an eighth pixel having the coordinates of (u,v) in aleft viewpoint image of the f th frame of the stereoscopic image of theV_(dis); the ∂ represents a fusion angle, wherein it is embodied that∂=120° herein; and the λ represents a brightness parameter of a display,wherein it is embodied that λ=1 herein;

{circle around (3)} adopting 2^(n) frames of the binocular fusionbrightness images as a frame group; respectively dividing the B_(org)and the B_(dis) into n_(GoF) frame groups; denoting an i th frame groupin the B_(org) as G_(org) ^(i); and denoting an i th frame group in theB_(dis) as G_(dis) ^(i); wherein: the n is an integer in a range of[3,5], wherein it is embodied that n=4 herein, namely adopting sixteenframes of the binocular fusion brightness images as the frame group;during a practical implementation, if a frame number of the binocularfusion brightness images respectively in the B_(org) and the B_(dis) isnot a positive integral multiple of 2^(n), after orderly dividing thebinocular fusion brightness images into a plurality of the frame groups,the redundant frames of the binocular fusion brightness images are notprocessed;

${n_{GoF} = \left\lfloor \frac{N_{f}}{2^{n}} \right\rfloor},$wherein the └ ┘ is a round-down symbol; and 1≦i≦n_(GoF);

{circle around (4)} processing each frame group in the B_(org) with aone-level three-dimensional wavelet transform, and obtaining eightgroups of first sub-band sequences corresponding to each frame group inthe B_(org), wherein: the eight groups of the first sub-band sequencescomprise four groups of first time-domain high-frequency sub-bandsequences and four groups of first time-domain low-frequency sub-bandsequences; and each group of the first sub-band sequence comprises

$\frac{2^{n}}{2}$first wavelet coefficient matrixes; herein, the four groups of the firsttime-domain high-frequency sub-band sequences corresponding to eachframe group in the B_(org) are respectively an original time-domainhigh-frequency approximate sequence HLL_(org), an original time-domainhigh-frequency horizontal detail sequence HLH_(org), an originaltime-domain high-frequency vertical detail sequence HHL_(org), and anoriginal time-domain high-frequency diagonal detail sequence HHH_(org);and the four groups of the first time-domain low-frequency sub-bandsequences corresponding to each frame group in the B_(org) arerespectively an original time-domain low-frequency approximate sequenceLLL_(org), an original time-domain low-frequency horizontal detailsequence LLH_(org), an original time-domain low-frequency verticaldetail sequence LHL_(org), and an original time-domain low-frequencydiagonal detail sequence LHH_(org); and

processing each frame group in the B_(dis) with the one-levelthree-dimensional wavelet transform, and obtaining eight groups ofsecond sub-band sequences corresponding to each frame group in theB_(dis), wherein: the eight groups of the second sub-band sequencescomprise four groups of second time-domain high-frequency sub-bandsequences and four groups of second time-domain low-frequency sub-bandsequences; each group of the second sub-band sequence comprises

$\frac{2^{n}}{2}$second wavelet coefficient matrixes; herein, the four groups of thesecond time-domain high-frequency sub-band sequences corresponding toeach frame group in the B_(dis) are respectively a distorted time-domainhigh-frequency approximate sequence HLL_(dis), a distorted time-domainhigh-frequency horizontal detail sequence HLH_(dis), a distortedtime-domain high-frequency vertical detail sequence HHL_(dis), and adistorted time-domain high-frequency diagonal detail sequence HHH_(dis);and the four groups of the second time-domain low-frequency sub-bandsequences corresponding to each frame group in the B_(dis) arerespectively a distorted time-domain low-frequency approximate sequenceLLL_(dis), a distorted time-domain low-frequency horizontal detailsequence LLH_(dis), a distorted time-domain low-frequency verticaldetail sequence LHL_(dis), and a distorted time-domain low-frequencydiagonal detail sequence LHH_(dis); wherein:

in the present invention, the binocular fusion brightness image videosare processed with a time-domain decomposition through thethree-dimensional wavelet transform; video time-domain information isdescribed based on frequency components; and to finish processing thetime-domain information in a wavelet domain solves a difficulty of atime-domain quality assessment in the video quality assessment to someextent and increases an accuracy of the assessment method;

{circle around (5)} calculating respective qualities of two groups amongthe eight groups of the second sub-band sequences corresponding to eachframe group in the B_(dis); and denoting a quality of a j th group ofthe second sub-band sequence corresponding to the G_(dis) ^(i) asQ^(i,j),

$Q^{i,j},{Q^{i,j} = \frac{\sum\limits_{k = 1}^{K}{{SSIM}\left( {{VI}_{org}^{i,j,k},{VI}_{dis}^{i,j,k}} \right)}}{K}},$wherein:

j=1,5, wherein: a first group of the second sub-band sequencecorresponding to the G_(dis) ^(i) is a first group of the secondtime-domain high-frequency sub-band sequence corresponding to theG_(dis) ^(i) when j=1; and a fifth group of the second sub-band sequencecorresponding to the G_(dis) ^(i) is a first group of the secondtime-domain low-frequency sub-band sequence corresponding to the G_(dis)^(i) when j=5;

1≦k≦K, wherein: the K represents a total number of the waveletcoefficient matrixes respectively in each group of the first sub-bandsequence corresponding to each frame group in the B_(org) and each groupof the second sub-band sequence corresponding to each frame group in theB_(dis); and

${K = \frac{2^{n}}{2}};$

the VI_(org) ^(i,j,k) represents a k th first wavelet coefficient matrixof a j th group of the first sub-band sequence corresponding to theG_(org) ^(i);

the VI_(dis) ^(i,j,k) represents a k th second wavelet coefficientmatrix of the j th group of the second sub-band sequence correspondingto the G_(dis) ^(i); and

SSIM( ) is a structural similarity calculation function,

${{{SSIM}\left( {{VI}_{org}^{i,j,k},{VI}_{dis}^{i,j,k}} \right)} = \frac{\left( {{2\mu_{org}\mu_{dis}} + c_{1}} \right)\left( {{2\sigma_{{org} - {dis}}} + c_{2}} \right)}{\left( {\mu_{org}^{2} + \mu_{dis}^{2} + c_{1}} \right)\left( {\sigma_{org}^{2} + \sigma_{dis}^{2} + c_{2}} \right)}},$wherein: the μ_(org) represents a mean value of values of all elementsin the VI_(org) ^(i,j,k); the μ_(dis) represents a mean value of valuesof all elements in the VI_(dis) ^(i,j,k); the σ_(org) represents avariance of the VI_(org) ^(i,j,k); the σ_(dis) represents a variance ofthe VI_(dis) ^(i,j,k); the σ_(org-dis) represents a covariance betweenthe VI_(org) ^(i,j,k) and the VI_(dis) ^(i,j,k); both the c₁ and the c₂are constants; the c₁ and the c₂ prevents a denominator from being 0;and it is embodied that c₁=0.05 and c₂=0.05 herein;

{circle around (6)} according to the respective qualities of two groupsamong the eight groups of the second sub-band sequences corresponding toeach frame group in the B_(dis), calculating a quality of each framegroup in the B_(dis); and denoting the quality of the G_(dis) ^(i) asQ_(GoF) ^(i), Q_(GoF) ^(i)=w_(G)×Q^(i,1)+(1−w_(G))×Q^(i,5), wherein: thew_(G) is a weight of the Q^(i,1), wherein it is embodied that w_(G)=0.8herein; the Q^(i,1) represents the quality of the first group of thesecond sub-band sequence corresponding to the G_(dis) ^(i), namely thequality of the first group of the second time-domain high-frequencysub-band sequence corresponding to the G_(dis) ^(i); the Q^(i,5)represents the quality of the fifth group of the second sub-bandsequence corresponding to the G_(dis) ^(i), namely the quality of thefirst group of the second time-domain low-frequency sub-band sequencecorresponding to the G_(dis) ^(i); and

{circle around (7)} according to the quality of each frame group in theB_(dis), calculating an objective assessment quality of the V_(dis) anddenoting the objective assessment quality of the V_(dis) as Q_(v),

${Q_{v} = \frac{\sum\limits_{i = 1}^{n_{GoF}}{w^{i} \times Q_{GoF}^{i}}}{\sum\limits_{i = 1}^{n_{GoF}}w^{i}}},$wherein: the w^(i) is a weight of the Q_(GoF) ^(i); and it is embodiedthat the w^(i) is obtained through following steps:

{circle around (7)}-1, calculating a motion vector of each pixel in eachframe of the binocular fusion brightness image of the G_(dis) ^(i)except a first frame of the binocular fusion brightness image, with areference to a previous frame of the binocular fusion brightness imageof each frame of the binocular fusion brightness image in the G_(dis)^(i) except the first frame of the binocular fusion brightness image;

{circle around (7)}-2, according to the motion vector of each pixel ineach frame of the binocular fusion brightness image of the G_(dis) ^(i)except the first frame of the binocular fusion brightness image,calculating a motion intensity of each frame of the binocular fusionbrightness image in the G_(dis) ^(i) except the first frame of thebinocular fusion brightness image; and denoting the motion intensity ofan f′th frame of the binocular fusion brightness image in the G_(dis)^(i) as MA^(f′),

${{MA}^{f^{\prime}} = {\frac{1}{U \times V}{\sum\limits_{s = 1}^{U}{\sum\limits_{t = 1}^{V}\left( {\left( {{mv}_{x}\left( {s,t} \right)} \right)^{2} + \left( {{mv}_{y}\left( {s,t} \right)} \right)^{2}} \right)}}}},$wherein: 2≦f′≦2^(n); the f′ has an initial value of 2; 1≦s≦U,1≦t≦V; themv_(x)(s,t) represents a horizontal component of the motion vector of apixel having coordinates of (s,t) in the f′th frame of the binocularfusion brightness image in the G_(dis) ^(i); and the mv_(y)(s,t)represents a vertical component of the pixel having the coordinates of(s,t) in the f′th frame of the binocular fusion brightness image in theG_(dis) ^(i);

{circle around (7)}-3, calculating a motion intensity of the G_(dis)^(i), denoted as MAavg^(i),

${{{MA}\;{avg}^{i}} = \frac{\sum\limits_{f^{\prime} = 2}^{2^{n}}{MA}^{f^{\prime}}}{2^{n} - 1}};$

{circle around (7)}-4, calculating a background brightness image of eachframe of the binocular fusion brightness image in the G_(dis) ^(i);denoting the background brightness image of an f″th frame of thebinocular fusion brightness image in the G_(dis) ^(i) as BL_(dis)^(i,f″); and denoting a pixel value of a first pixel having coordinatesof (p,q) in the BL_(dis) ^(i,f″) as BL_(dis) ^(i,f″)(p,q),

${{{BL}_{dis}^{i,f^{''}}\left( {p,q} \right)} = {\frac{1}{32}{\sum\limits_{{bi} = {- 2}}^{2}{\overset{2}{\sum\limits_{{bj} = {- 2}}}{{I_{dis}^{i,f^{''}}\left( {{p + {bi}},{q + {bi}}} \right)} \times {{BO}\left( {{{bi} + 3},{{bj} + 3}} \right)}}}}}},$wherein: 1≦f″≦2^(n); 3≦p≦U−2,3≦q≦V−2; −2≦bi≦2,−2≦bj≦2; the I_(dis)^(i,f″)(p+bi,q+bi) represents a pixel value of a pixel havingcoordinates of (p+bi,q+bi) in the f″th frame of the binocular fusionbrightness image of the G_(dis) ^(i); and the BO(bi+3,bj+3) representsan element at a subscript of (bi+3,bj+3) in a 5×5 background brightnessoperator, wherein it is embodied that the 5×5 background brightnessoperator herein is

$\begin{bmatrix}1 & 1 & 1 & 1 & 1 \\1 & 2 & 2 & 2 & 1 \\1 & 2 & 0 & 2 & 1 \\1 & 2 & 2 & 2 & 1 \\1 & 1 & 1 & 1 & 1\end{bmatrix};$

{circle around (7)}-5, calculating a brightness difference image betweeneach frame of the binocular fusion brightness image and the previousframe of the binocular fusion brightness image of each frame of thebinocular fusion brightness image in the G_(dis) ^(i) except the firstframe of the binocular fusion brightness image; denoting the brightnessdifference image between the f′th frame of the binocular fusionbrightness image in the G_(dis) ^(i) and an f′−1th frame of thebinocular fusion brightness image in the G_(dis) ^(i) as LD_(dis)^(i,f′); and denoting a pixel value of a second pixel having thecoordinates of (p,q) in the LD_(dis) ^(i,f′) as LD_(dis) ^(i,f′)(p,q),LD _(dis) ^(i,f′)(p,q)=(I _(dis) ^(i,f′)(p,q)−I _(dis) ^(i,f′-1)(p,q)+BL_(dis) ^(i,f′)(p,q)−BL _(dis) ^(i,f′-1)(p,q))/2,

wherein: 2≦f′≦2^(n); 3≦p≦U−2,3≦q≦V−2; the I_(dis) ^(i,f′)(p,q)represents a pixel value of a third pixel having the coordinates of(p,q) in the f′th frame of the binocular fusion brightness image in theG_(dis) ^(i); the I_(dis) ^(i,f′-1)(p,q) represents a pixel value of afourth pixel having the coordinates of (p,q) in the f′−1th frame of thebinocular fusion brightness image in the G_(dis) ^(i); the BL_(dis)^(i,f′)(p,q) represents a pixel value of a fifth pixel having thecoordinates of (p,q) in the background brightness image BL_(dis) ^(i,f″)of the f′th frame of the binocular fusion brightness image of theG_(dis) ^(i); and the BL_(dis) ^(i,f′-1)(p,q) represents a pixel valueof a sixth pixel having the coordinates of (p,q) in the backgroundbrightness image BL_(dis) ^(i,f′-1) of the f′−1th frame of the binocularfusion brightness image of the G_(dis) ^(i);

{circle around (7)}-6, calculating a mean value of the pixel values ofall the pixels in the brightness difference image between each frame ofthe binocular fusion brightness image and the previous frame of thebinocular fusion brightness image of each frame of the binocular fusionbrightness image in the G_(dis) ^(i) except the first frame of thebinocular fusion brightness image; denoting the mean value of the pixelvalues of all the pixels in the LD_(dis) ^(i,f′) as LD^(i,f′);calculating a brightness difference value of the G_(dis) ^(i) anddenoting the brightness difference value of the G_(dis) ^(i) asLDavg^(i),

${{LDavg}^{i} = \frac{\sum\limits_{f^{\prime} = 2}^{2^{n}}{LD}^{i,f^{\prime}}}{2^{n} - 1}};$

{circle around (7)}-7, obtaining a motion intensity vector of theB_(dis) from the respective motion intensities of all the frame groupsin the B_(dis) in order, and denoting the motion intensity vector of theB_(dis) as V_(MAavg),V_(MAavg)=[MAavg¹,MAavg², . . . ,MAavg^(i), . . . ,MAavg^(n) ^(GoF) ];

obtaining a brightness difference vector of the B_(dis) from therespective brightness difference values of all the frame groups in theB_(dis) in order, and denoting the brightness difference vector of theB_(dis) as V_(LDavg),V_(LDavg)=[LDavg¹,LDavg², . . . ,LDavg^(i), . . . ,LDavg^(n) ^(GoF) ];wherein:

the MAavg¹, the MAavg², and the MAavg^(n) ^(GoF) respectively representthe motion intensities of a first frame group, a second frame group anda n_(GoF)th frame group in the B_(dis); the LDavg¹, the LDavg², and theLDavg^(n) ^(GoF) respectively represent the brightness difference valueof the first frame group, the second frame group and the n_(GoF)th framegroup in the B_(dis);

{circle around (7)}-8, processing the MAavg^(i) with a normalizationcalculation, and obtaining a normalized motion intensity of the G_(dis)^(i), denoted as v_(MAavg) ^(norm,i),

${v_{MAavg}^{{norm},i} = \frac{{MAavg}^{i} = {- {\max\left( V_{MAavg} \right)}}}{{\max\left( V_{MAavg} \right)} - {\min\left( V_{MAavg} \right)}}};$

processing the LDavg^(i) with the normalization calculation, andobtaining a normalized brightness difference value of the G_(dis) ^(i),denoted as v_(LDavg) ^(norm,i),

${v_{LDavg}^{{norm},i} = \frac{{LDavg}^{i} - {\max\left( V_{LDavg} \right)}}{{\max\left( V_{LDavg} \right)} - {\min\left( V_{LDavg} \right)}}};$

wherein the max( ) is a function to find a maximum and the min( ) is afunction to find a minimum; and

{circle around (7)}-9, according to the v_(MAavg) ^(norm,i) and thev_(LDavg) ^(norm,i), calculating the weight w^(i) of the Q_(GoF) ^(i),w^(i)=(1−v_(MAavg) ^(norm,i))×v_(LDavg) ^(norm,i).

In order to illustrate effectiveness and feasibility of the methodprovided by the present invention, a NAMA3DS1-CoSpaD1 stereoscopic videodatabase (NAMA3D video database in short) provided by a French IRCCyNresearch institution is adopted for a verification test, for analyzing acorrelation between an objective assessment result of the methodprovided by the present invention and a difference mean opinion score(DMOS). The NAMA3D video database comprises 10 original high-definitionstereoscopic videos showing different scenes. Each originalhigh-definition stereoscopic video is treated with an H.264 codingcompression distortion or a JPEG2000 coding compression distortion. TheH.264 coding compression distortion has 3 different distortion degrees,namely totally 30 first distorted stereoscopic videos; and the JPEG2000coding compression distortion has 4 different distortion degrees, namelytotally 40 second distorted stereoscopic videos. Through the steps{circle around (1)}-{circle around (7)} of the method provided by thepresent invention, the above 70 distorted stereoscopic videos arecalculated in the same manner to obtain an objective assessment qualityof each distorted stereoscopic video relative to a correspondingundistorted stereoscopic video; then the objective assessment quality ofeach distorted stereoscopic video is processed through a four-parameterLogistic function non-linear fitting with the DMOS; and finally, aperformance index value between the objective assessment result and asubjective perception is obtained. Herein, three common objectiveparameters for assessing a video quality assessment method serve asassessment indexes. The three objective parameters are respectivelyCorrelation coefficient (CC), Spearman Rank Order Correlationcoefficient (SROCC) and Rooted Mean Squared Error (RMSE). A range of thevalue of the CC and the SROCC is [0, 1]. The nearer a value approximatesto 1, the more accurate an objective assessment method is; otherwise,the objective assessment method is less accurate. The smaller RMSE, thehigher accuracy of a predication of the objective assessment method, andthe better performance of the objective assessment method; otherwise,the predication of the objective assessment method is worse. Theassessment indexes, CC, SROCC and RMSE, for representing the performanceof the method provided by the present invention are listed in Table 1.According to data listed in the Table 1, the objective assessmentquality of the distorted stereoscopic video, which is obtained throughthe method provided by the present invention, has a good correlationwith the DMOS. For H.264 coding compression distorted videos, the CCreaches 0.8712; the SROCC reaches 0.8532; and the RMSE is as low as5.7212. For JPEG2000 coding compression distorted videos, the CC reaches0.9419; the SROCC reaches 0.9196; and the RMSE is as low as 4.1915. Foran overall distorted video comprising both the H.264 coding compressiondistorted videos and the JPEG2000 coding compression distorted videos,the CC reaches 0.9201; the SROCC reaches 0.8910; and the RMSE is as lowas 5.0523. Thus, the objective assessment result of the method providedby the present invention is relatively consistent with a human eyesubjective perception result, which fully proves the effectiveness ofthe method provided by the present invention.

TABLE 1 Correlation between objective assessment quality of distortedstereoscopic video calculated through method provided by presentinvention and DMOS CC SROCC RMSE 30 H.264 coding compression 0.87120.8532 5.7212 stereoscopic videos 40 JPEG2000 coding compression 0.94190.9196 4.1915 stereoscopic videos Totally 70 distorted stereoscopic0.9201 0.8910 5.0523 videos

One skilled in the art will understand that the embodiment of thepresent invention as shown in the drawings and described above isexemplary only and not intended to be limiting.

It will thus be seen that the objects of the present invention have beenfully and effectively accomplished. Its embodiments have been shown anddescribed for the purposes of illustrating the functional and structuralprinciples of the present invention and is subject to change withoutdeparture from such principles. Therefore, this invention includes allmodifications encompassed within the spirit and scope of the followingclaims.

What is claimed is:
 1. An objective assessment method for a stereoscopicvideo quality based on a wavelet transform, comprising steps of: {circlearound (1)} representing an original undistorted stereoscopic video byV_(org), and representing a distorted stereoscopic video to-be-assessedby V_(dis); {circle around (2)} calculating a binocular fusionbrightness of each pixel in each frame of a stereoscopic image of theV_(org); denoting the binocular fusion brightness of a first pixelhaving coordinates of (u,v) in an f th frame of the stereoscopic imageof the V_(org) as B_(org) ^(f)(u,v), B_(org) ^(f)(u,v)=√{square rootover ((I_(org) ^(R,f)(u,v))²+(I_(org) ^(L,f)(u,v))²+2(I_(org)^(R,f)(u,v)×I_(org) ^(L,f)(u,v)×cos ∂)×λ)}; then according to therespective binocular fusion brightnesses of all the pixels in each frameof the stereoscopic image of the V_(org), obtaining a binocular fusionbrightness image of each frame of the stereoscopic image in the V_(org);denoting the binocular fusion brightness image of the f th frame of thestereoscopic image in the V_(org) as B_(org) ^(f), wherein a secondpixel having the coordinates of (u,v) in the B_(org) ^(f) has a pixelvalue of the B_(org) ^(f)(u,v); according to the respective binocularfusion brightness images of all the stereoscopic images in the V_(org),obtaining a binocular fusion brightness image video corresponding to theV_(org), denoted as B_(org), wherein an f th frame of the binocularfusion brightness image in the B_(org) is the B_(org) ^(f); andcalculating a binocular fusion brightness of each pixel in each frame ofa stereoscopic image of the V_(dis); denoting the binocular fusionbrightness of a third pixel having the coordinates of (u,v) in an f thframe of the stereoscopic image of the V_(dis) as B_(dis) ^(f)(u,v),B_(dis) ^(f)(u,v)=√{square root over ((I_(dis) ^(R,f)(u,v))²+(I_(dis)^(L,f)(u,v))²+2(I_(dis) ^(R,f)(u,v)×I_(dis) ^(L,f)(u,v)×cos ∂)×λ)}; thenaccording to the respective binocular fusion brightnesses of all thepixels in each frame of the stereoscopic image of the V_(dis), obtaininga binocular fusion brightness image of each frame of the stereoscopicimage in the V_(dis); denoting the binocular fusion brightness image ofthe f th frame of the stereoscopic image in the V_(dis) as B_(dis) ^(f),wherein a fourth pixel having the coordinates of (u,v) in the B_(dis)^(f) has a pixel value of the B_(dis) ^(f)(u,v); according to therespective binocular fusion brightness images of all the stereoscopicimages in the V_(dis), obtaining a binocular fusion brightness imagevideo corresponding to the V_(dis), denoted as B_(dis), wherein an f thframe of the binocular fusion brightness image in the B_(dis) is theB_(dis) ^(f); wherein: 1≦f≦N_(f), wherein the f has an initial value of1; the N_(f) represents a total frame number of the stereoscopic imagesrespectively in the V_(org) and the V_(dis); 1≦u≦U,1≦v≦V, wherein the Urepresents a width of the stereoscopic image respectively in the V_(org)and the V_(dis), and the V represents a height of the stereoscopic imagerespectively in the V_(org) and the V_(dis); the I_(org) ^(R,f)(u,v)represents a brightness value of a fifth pixel having the coordinates of(u,v) in a right viewpoint image of the f th frame of the stereoscopicimage of the V_(org); the I_(org) ^(L,f)(u,v) represents a brightnessvalue of a sixth pixel having the coordinates of (u,v) in a leftviewpoint image of the f th frame of the stereoscopic image of theV_(org); the I_(dis) ^(R,f)(u,v) represents a brightness value of aseventh pixel having the coordinates of (u,v) in a right viewpoint imageof the f th frame of the stereoscopic image of the V_(dis); the I_(dis)^(L,f)(u,v) represents a brightness value of an eighth pixel having thecoordinates of (u,v) in a left viewpoint image of the f th frame of thestereoscopic image of the V_(dis); the ∂ represents a fusion angle; andthe λ represents a brightness parameter of a display; {circle around(3)} adopting 2^(n) frames of the binocular fusion brightness images asa frame group; respectively dividing the B_(org) and the B_(dis) inton_(GoF) frame groups; denoting an i th frame group in the B_(org) asG_(org) ^(i); and denoting an i th frame group in the B_(dis) as G_(dis)^(i); wherein: the n is an integer in a range of [3,5];${n_{GoF} = \left\lfloor \frac{N_{f}}{2^{n}} \right\rfloor},$  whereinthe └ ┘ is a round-down symbol; and 1≦i≦n_(GoF); {circle around (4)}processing each frame group in the B_(org) with a one-levelthree-dimensional wavelet transform, and obtaining eight groups of firstsub-band sequences corresponding to each frame group in the B_(org),wherein: the eight groups of the first sub-band sequences comprise fourgroups of first time-domain high-frequency sub-band sequences and fourgroups of first time-domain low-frequency sub-band sequences; and eachgroup of the first sub-band sequence comprises $\frac{2^{n}}{2}$  firstwavelet coefficient matrixes; and processing each frame group in theB_(dis) with the one-level three-dimensional wavelet transform, andobtaining eight groups of second sub-band sequences corresponding toeach frame group in the B_(dis), wherein: the eight groups of the secondsub-band sequences comprise four groups of second time-domainhigh-frequency sub-band sequences and four groups of second time-domainlow-frequency sub-band sequences; and each group of the second sub-bandsequence comprises $\frac{2^{n}}{2}$  second wavelet coefficientmatrixes; {circle around (5)} calculating respective qualities of twogroups among the eight groups of the second sub-band sequencescorresponding to each frame group in the B_(dis); and denoting a qualityof a j th group of the second sub-band sequence corresponding to theG_(dis) ^(i) as$Q^{i,j},{Q^{i,j} = \frac{\sum\limits_{k = 1}^{K}{{SSIM}\left( {{VI}_{org}^{i,j,k},{VI}_{dis}^{i,j,k}} \right)}}{K}},$ wherein: j=1,5; the 1≦k≦K; the K represents a total number of thewavelet coefficient matrixes respectively in each group of the firstsub-band sequence corresponding to each frame group in the B_(org) andeach group of the second sub-band sequence corresponding to each framegroup in the B_(dis); and ${K = \frac{2^{n}}{2}};$  the VI_(org)^(i,j,k) represents a k th first wavelet coefficient matrix of a j thgroup of the first sub-band sequence corresponding to the G_(org) ^(i);the VI_(dis) ^(i,j,k) represents a k th second wavelet coefficientmatrix of the j th group of the second sub-band sequence correspondingto the G_(dis) ^(i); and SSIM( ) is a structural similarity calculationfunction; {circle around (6)} according to the respective qualities oftwo groups among the eight groups of the second sub-band sequencescorresponding to each frame group in the B_(dis), calculating a qualityof each frame group in the B_(dis); and denoting the quality of theG_(dis) ^(i) as Q_(GoF) ^(i), Q_(GoF)^(i)=w_(G)×Q^(i,1)+(1−w_(G))×Q^(i,5), wherein: the w_(G) is a weight ofthe Q^(i,1); the Q^(i,1) represents the quality of a first group of thesecond sub-band sequence corresponding to the G_(dis) ^(i); and theQ^(i,5) represents the quality of a fifth group of the second sub-bandsequence corresponding to the G_(dis) ^(i); and {circle around (7)}according to the quality of each frame group in the B_(dis), calculatingan objective assessment quality of the V_(dis) and denoting theobjective assessment quality of the${V_{dis}\mspace{14mu}{as}\mspace{14mu} Q_{v}},{Q_{v} = \frac{\sum\limits_{i = 1}^{n_{GoF}}{w^{i} \times Q_{GoF}^{i}}}{\sum\limits_{i = 1}^{n_{GoF}}w^{i}}},$ wherein the w^(i) is a weight of the Q_(GoF) ^(i).
 2. The objectiveassessment method for the stereoscopic video quality based on thewavelet transform, as recited in claim 1, wherein the w^(i) in the step{circle around (7)} is obtained through steps of: {circle around (7)}-1,calculating a motion vector of each pixel in each frame of the binocularfusion brightness image of the G_(dis) ^(i) except a first frame of thebinocular fusion brightness image, with a reference to a previous frameof the binocular fusion brightness image of each frame of the binocularfusion brightness image in the G_(dis) ^(i) except the first frame ofthe binocular fusion brightness image; {circle around (7)}-2, accordingto the motion vector of each pixel in each frame of the binocular fusionbrightness image of the G_(dis) ^(i) except the first frame of thebinocular fusion brightness image, calculating a motion intensity ofeach frame of the binocular fusion brightness image in the G_(dis) ^(i)except the first frame of the binocular fusion brightness image; anddenoting the motion intensity of an f′th frame of the binocular fusionbrightness image in the G_(dis) ^(i) as MA^(f′),${{MA}^{f^{\prime}} = {\frac{1}{U \times V}{\sum\limits_{s = 1}^{U}{\sum\limits_{t = 1}^{V}\left( {\left( {{mv}_{x}\left( {s,t} \right)} \right)^{2} + \left( {{mv}_{y}\left( {s,t} \right)} \right)^{2}} \right)}}}};$wherein: 2≦f′≦2^(n); the f′ has an initial value of 2; 1≦s≦U,1≦t≦V; themv_(x)(s,t) represents a horizontal component of the motion vector of apixel having coordinates of (s,t) in the f′th frame of the binocularfusion brightness image in the G_(dis) ^(i); and the mv_(y)(s,t)represents a vertical component of the pixel having the coordinates of(s,t) in the f′th frame of the binocular fusion brightness image in theG_(dis) ^(i); {circle around (7)}-3, calculating a motion intensity ofthe G_(dis) ^(i), denoted as MAavg^(i),${{MAavg}^{i} = \frac{\sum\limits_{f^{\prime} = 2}^{2^{n}}{MA}^{f^{\prime}}}{2^{n} - 1}};${circle around (7)}-4, calculating a background brightness image of eachframe of the binocular fusion brightness image in the G_(dis) ^(i);denoting the background brightness image of an f″th frame of thebinocular fusion brightness image in the G_(dis) ^(i) as BL_(dis)^(i,f″); and denoting a pixel value of a first pixel having coordinatesof (p,q) in the BL_(dis) ^(i,f″) as BL_(dis) ^(i,f″)(p,q),${{{BL}_{dis}^{i,f^{''}}\left( {p,q} \right)} = {\frac{1}{32}{\sum\limits_{{bi} = {- 2}}^{2}{\sum\limits_{{bj} = {- 2}}^{2}{{I_{dis}^{i,f^{''}}\left( {{p + {bi}},{q + {bi}}} \right)} \times {{BO}\left( {{{bi} + 3},{{bj} + 3}} \right)}}}}}},$ wherein: 1≦f″≦2^(n); 3≦p≦U−2,3≦q≦V−2; −2≦bi≦2,−2≦bj≦2; the I_(dis)^(i,f″)(p+bi,q+bi) represents a pixel value of a pixel havingcoordinates of (p+bi,q+bi) in the f″th frame of the binocular fusionbrightness image of the G_(dis) ^(i); and the BO(bi+3,bj+3) representsan element at a subscript of (bi+3,bj+3) in a 5×5 background brightnessoperator; {circle around (7)}-5, calculating a brightness differenceimage between each frame of the binocular fusion brightness image andthe previous frame of the binocular fusion brightness image of eachframe of the binocular fusion brightness image in the G_(dis) ^(i)except the first frame of the binocular fusion brightness image;denoting the brightness difference image between the f′th frame of thebinocular fusion brightness image in the G_(dis) ^(i) and an f′−1thframe of the binocular fusion brightness image in the G_(dis) ^(i) asLD_(dis) ^(i,f′); and denoting a pixel value of a second pixel havingthe coordinates of (p,q) in the LD_(dis) ^(i,f′) as LD_(dis)^(i,f′)(p,q),LD _(dis) ^(i,f′)(p,q)=(I _(dis) ^(i,f′)(p,q)−I _(dis) ^(i,f′-1)(p,q)+BL_(dis) ^(i,f′)(p,q)−BL _(dis) ^(i,f′-1)(p,q))/2, wherein: 2≦f′≦2^(n);3≦p≦U−2,3≦q≦V−2; the I_(dis) ^(i,f′)(p,q) represents a pixel value of athird pixel having the coordinates of (p,q) in the f′th frame of thebinocular fusion brightness image in the G_(dis) ^(i); the I_(dis)^(i,f′-1)(p,q) represents a pixel value of a fourth pixel having thecoordinates of (p,q) in the f′−1th frame of the binocular fusionbrightness image in the G_(dis) ^(i); the BL_(dis) ^(i,f′)(p,q)represents a pixel value of a fifth pixel having the coordinates of(p,q) in the background brightness image BL_(dis) ^(i,f″) of the f′thframe of the binocular fusion brightness image of the G_(dis) ^(i); andthe BL_(dis) ^(i,f′-1)(p,q) represents a pixel value of a sixth pixelhaving the coordinates of (p,q) in the background brightness imageBL_(dis) ^(i,f′-1) of the f′−1th frame of the binocular fusionbrightness image of the G_(dis) ^(i); {circle around (7)}-6, calculatinga mean value of the pixel values of all the pixels in the brightnessdifference image between each frame of the binocular fusion brightnessimage and the previous frame of the binocular fusion brightness image ofeach frame of the binocular fusion brightness image in the G_(dis) ^(i)except the first frame of the binocular fusion brightness image;denoting the mean value of the pixel values of all the pixels in theLD_(dis) ^(i,f′) as LD^(i,f′); calculating a brightness difference valueof the G_(dis) ^(i) and denoting the brightness difference value of theG_(dis) ^(i) as LDavg^(i),${{LDavg}^{i} = \frac{\sum\limits_{f^{\prime} = 2}^{2^{n}}{LD}^{i,f^{\prime}}}{2^{n} - 1}};${circle around (7)}-7, obtaining a motion intensity vector of theB_(dis) from the respective motion intensities of all the frame groupsin the B_(dis) in order, and denoting the motion intensity vector of theB_(dis) as V_(MAavg),V_(MAavg)=[MAavg¹,MAavg², . . . ,MAavg^(i), . . . ,MAavg^(n) ^(GoF) ];obtaining a brightness difference vector of the B_(dis) from therespective brightness difference values of all the frame groups in theB_(dis) in order, and denoting the brightness difference vector of theB_(dis) as V_(LDavg),V_(LDavg)=[LDavg¹,LDavg², . . . ,LDavg^(i), . . . ,LDavg^(n) ^(GoF) ];wherein: the MAavg¹, the MAavg², and the MAavg^(n) ^(GoF) respectivelyrepresent the motion intensities of a first frame group, a second framegroup and a n_(GoF)th frame group in the B_(dis); the LDavg¹, theLDavg², and the LDavg^(n) ^(GoF) respectively represent the brightnessdifference value of the first frame group, the second frame group andthe n_(GoF)th frame group in the B_(dis); {circle around (7)}-8,processing the MAavg^(i) with a normalization calculation, and obtaininga normalized motion intensity of the G_(dis) ^(i), denoted as v_(MAavg)^(norm,i),${v_{MAavg}^{{norm},i} = \frac{{MAavg}^{i} - {\max\left( V_{MAavg} \right)}}{{\max\left( V_{MAavg} \right)} - {\min\left( V_{MAavg} \right)}}};$processing the LDavg^(i) with the normalization calculation, andobtaining a normalized brightness difference value of the G_(dis) ^(i),denoted as v_(LDavg) ^(norm,i),${v_{LDavg}^{{norm},i} = \frac{{LDavg}^{i} - {\max\left( V_{LDavg} \right)}}{{\max\left( V_{LDavg} \right)} - {\min\left( V_{LDavg} \right)}}};$wherein the max( ) is a maximum function and the min( ) is a minimumfunction; and {circle around (7)}-9, according to the v_(MAavg)^(norm,i) and the v_(LDavg) ^(norm,i), calculating the weight w^(i) ofthe Q_(GoF) ^(i), w^(i)=(1−v_(MAavg) ^(norm,i))×v_(LDavg) ^(norm,i). 3.The objective assessment method for the stereoscopic video quality basedon the wavelet transform, as recited in claim 1, wherein: in the step{circle around (6)}, w_(G)=0.8.
 4. The objective assessment method forthe stereoscopic video quality based on the wavelet transform, asrecited in claim 2, wherein: in the step {circle around (6)}, w_(G)=0.8.